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The present invention concerns diagnostic and therapeutic means for 
5 pathologies associated with kidney stones. 

Nephrolithiasis is a common multifactorial disorder characterized by 
the presence of calculi (stones) in the urinary tract; an important genetic 
contribution to its pathogenesis has been established; its aetiology is 
unknown (Jaeger 1996; Curhan etal. 1997; Baggio 1999; Scheinman 1999). 
10 Kidney stones are estimated to affect 10% of the population (Serio and 
Fraioli 1999; Rivers et ai. 2000). The major classes of stones are calcium 
oxalate, calcium phosphate, uric acid, struvite, and cystine. Uric acid 
nephrolithiasis (UAN) accounts for 20% of all stones. Formation of calculi 
may occur when the urine becomes overly concentrated with uric acid that 
15 may complex to form small crystals and subsequently stones. However, the 
molecular basis for urate handling in the human kidney is not completely 
clear. 

The authors of the invention have previously identified in patients in 
the small Sardinian village of Talana a locus of approximately 2.5 cM 

20 potentially correlated with UAN and located on chromosome 10q21-q22 
(Ombra et al. 2001). However, the locus cannot be used for screening 
assays on populations, nor for the development of diagnostic assays, nor to 
define therapeutic agents. It is therefore vital to identify the genomic region 
associated with the pathology. 

25 To identify the region, the authors have extended the search to a 

larger sample of affected individuals selected from the same study group. 
Using mapping methods based on sharing of alleles and/or haplotypes 
identical by descent or on autozygosity, a linkage disequilibrium (LD) block of 
about 200 kb was found in the Talana population and a core region of 67 kb 

30 already associated with UAN. The authors have identified in the associated 
region a novel gene, called ZNF365, divided into 15 exons spanning a 
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genomic region of about 300 kb and encoding for at least four different 
protein isoforms of 407, 333, 462 and 216 amino acids. The encoding 
sequence for the protein isoform of 216 amino acids was found to reside 
completely within the core interval of 67 kb. The protein has at least one 

5 transmembrane domain and various sites for N-terminal N- and O- 

glycosylation, suggesting that it may be a membrane protein. Mutation 
analysis showed that a mutation of a coding nucleotide, which causes a 
missense in exon 12 (Ala62Thr), is strongly associated with UAN (p=0.0096). 
Moreover, the variant protein has a different secondary structure and might 

10 be involved in the aetiology of UAN. 

Hence, the object of the present invention is a nucleic acid comprising 
at least one fragment of the human gene ZNF365, in which the fragment 
encodes for a functional fragment of at least one of the proteins of the 
ZNF365 group to be used in the diagnosis of pathologies associated with 

15 kidney stones. 1 

A further object of the invention is a nucleic acid comprising at least a 
fragment of the human gene ZNF365, in which the fragment encodes for a 
functional portion of at least one of the proteins of the ZNF365 group for 
therapeutic use in pathologies associated with kidney stones. 

20 A further object of the invention is a method to detect in an individual 

at least one mutation of the gene encoding for one of the proteins of the 
ZNF365 group located on chromosome 10, comprising the phases: 

- collection of a sample containing a sufficient quantity of DNA from the 
aforesaid individual or reproducible in culture; 

25 - isolation of the DNA of the collected sample; 

- submission of the isolated DNA to exponential amplification using as an 
primer pair for the amplification reaction at least two oligonucleotides that are 
able to amplify at least one fragment of the human gene ZNF365, in which 
the fragment encodes for a functional portion of at least one of the proteins of 

30 the ZNF365 group; 
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- detection of any mutations in at least one amplified fragment compared with 
healthy controls. 

Preferably, the DNA exponential amplification phase is performed 
using primer pairs able to amplify a part of the fragment encoding the human 
5 gene ZNF365. More preferably, the DNA exponential amplification phase to 
amplify a part of the fragment encoding the human gene ZNF365 comprises 
the use of the following primer pairs: 

Ala62Thr-F: 5' CTC CAC TCC ACC TTT TTA AG 3' 
Ala62Thr-R: 5' GCT GAC ATT GGT ACT TAC TG 3\ 
10 More preferably, the detection phase of any mutations in at least one 

amplified fragment compared with healthy controls is performed using direct 
sequencing. An expert from the field will know which alternative methods 
come under the range of protection of the invention. 

A further object of the invention is a diagnostic kit for pathologies 
15 associated with kidney stones and comprises: 

- at least one pair of oligonucleotide primers for the exponential amplification 
reaction of at least one fragment of the human gene ZNF365, in which a 
fragment encodes for a functional portion of at least one of the proteins of the 
ZNF 365 group; 

20 - a control DNA from a healthy individual not affected by pathologies 
associated with kidney stones. 

Preferably, the oligonucleotide primer pairs in the kit for performing the 
amplification reaction are able to amplify a part of the region encoding the 
ZNF365 gene. 

25 A further object of the invention is a protein belonging to the group of 

ZNF365 proteins or a functional portion thereof for use in the diagnosis of 
pathologies associated with kidney stones. 

A further object of the invention is a protein belonging to the group of 
ZNF365 proteins or a functional portion thereof for use in the treatment of 

30 pathologies associated with kidney stones. 
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The invention is described below with broad examples and in - 
reference to the following figures: 

Figure 1. Physical and transcriptional map of the UAN critical region. 
Markers of the physical map are indicated vertically above the solid bar 
5 representing genomic DNA. Microsatellites used in linkage analysis are 
D10S1719, D10S1652 and D10S1640. All genes and ESTs located in this 
region are shown. 

Figure 2. SNPs map of the LD block. 

Figure 3. Genomic structure of the ZNF365 gene showing alternative 

10 splicing and alternative start sites that generate four different transcripts. 
Initiation codons (ATG) and stop codons (TAA or TGA) are shown for all 
isoforms. Two different promoters, a CpG island (P1) and a TATA box (P2) 
are indicated with arrows. Exons used for each transcript are indicated as 
colored boxes, while the LD block is indicated as a grey box. Two missense 

15 variants found in exons 5 and 12 are shown. 

Figure 4. Relationship between the amino acid sequences of ZNF365 
proteins. Grey boxes denote identical amino acids. Solid lines indicate the 
C2H2 zinc finger domain and coiled-coil fragments. The predicted 
transmembrane domains are shown in the rectangle. 

20 Figure 5. Results of the association study. The molecular markers 

used in the study are indicated at the top. The position on the physical map 
and the association results are indicated for each marker. The different 
configurations of the associated haplotypes are shown at the bottom. 

Figure 6. Secondary structure predictions using the SecStr package 

25 are shown. The most probable structures, predicted by three or more 
methods, are colored black in the joint prediction histograms. (A) The 
structural prediction for the non associated allele (ala 62). (B) The structural 
prediction for the uric acid nephrolithiasis susceptibility allele (thr62). 
Subjects and methods 

30 Sample collection 



WO 2004/035828 



5 



PCT/IT2003/000627 



All individuals participating in the study came from an isolated village 
(Talana) in central Sardinia (1200 inhabitants) that is characterized by slow 
population growth, high endogamy, and high inbreeding (Wright et al. 1999; 
Angius et al. 2001). The diagnosis of UAN was clinically confirmed by 
5 accurate physical examination and by renal ultrasonography. The original 
patient list included 134 affected subjects. For this study we selected. 62 
patients (mean age 58.4 years) with severe UAN. All subjects gave signed 
informed consent, and all samples were taken according to the Helsinki 
Declaration. 

10 SNP genotyping 

To find common SNPs in the UAN critical region, we performed an 
extensive alignment of genomic sequences deposited in public databases 
using Sequencher software. The screening panel included 10 unrelated 
persons from the Talana population. PCR amplicons of 800-1000 nt 

15 containing the selected putative polymorphisms were designed using the 
Oligo 4.0 software program. PCR reactions were prepared using 50 ng of 
genomic DNA template, 0.5 pM of each PCR-primer, 1.5 mM MgCI 2 , 0.2 mM 
of each dNTP, 0.5 U Taq polymerase and the buffer recommended by the 
supplier (Amersham) in a final volume of 25 pi. Thermo cycling started with a 

20 single denaturation step for 2 min at 94°C, followed by 35 cycles of 
denaturation for 30 sec at 94°C, annealing for 30 sec at Temperature Melting 
(TM) and extension at 72°C for 45 sec. One final extension step was added 
for 7 min at 72°C. 

Samples were then sequenced using the Big Dye Terminator Ready 
25 Reaction Kit (Applied Biosystem). Sequencing reactions were performed on a 
9700 Thermal Cycler (Applied Biosystems) for 25 cycles at 95°C for 10 sec, 
TM for 5 sec and at 60°C for 2 min. After the sequencing, each reaction was 
isopropanol-precipitated. Sequencing of the products was performed on the 
ABI prism 3100 Genetic Analyser (Applied Biosystems). Polymorphisms were 
30 detected by multiple alignments of sequences using the Autoassembler 
program (Applied Biosystems). 
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Mutational analysis of all 15 exons of the identified ZNF365 gene was 
performed using PGR, including the intron/exon boundaries. 

Genotyping of the single nucleotide polymorphisms was performed by 
dot-blot hybridisation of the PCR products, comprising the polymorphisms, 
5 with short allele-specific oligonucleotides (ASO probes) (Ristaldi et al. 1989). 
cDNA clones 

ZNF365A (KIAA0844 accession number ABO20651; 
http://www.kazusa.or.jp/huge/) was provided by the Kazusa DNA Research 
Institute. ZNF365B cDNA sequence was obtained by sequencing the cDNA 

10 clones 3069791 and 4821260 from the IMAGE Consortium. ZNF365C and 
ZNF365D cDNA sequences were obtained by RT-PCR/TOPO-cloning vector 
(Invitrogen) strategy using primer pairs in different exons of the gene on 
human kidney RNA (Clontech) and 5- and 3'-RACE strategy carried out 
using a human kidney Marathon-Ready cDNA kit. 

15 RNA expression studies 

A human multiple-tissue Northern blot (Clontech) was hybridized with 
ZNF365A, 6, C and D transcripts. The Northern blot was prehybridized, 
hybridized, and washed according to the manufacturer's directions 
(Clontech). 

20 RT-PCR analysis: 0.5 pg of human kidney (Clontech), adrenal gland 

(Clontech) and blood mRNA were reverse-transcribed in a 50 pi reaction 
mixture containing 1x RT buffer (20 mM Tris-HCI pH 8.4, 50 mM KCI, 2.5 mM 
MgCI 2 ), 10 mM DTT, 0.5 mM dNTP, 0.2 pg of random hexamers (Roche), 
and 200 units of Superscript reverse transcriptase (Invitrogen). After 60 min 

25 incubation at 37°C, 1 pg of DNase-free RNase was added and incubated 10 
min at 37°C. The cDNA was extracted twice with phenol/chloroform/iso- 
amylalcohol (25:24:1). The cDNA was then precipitated overnight. with 1/10 
vol of 3 M sodium acetate, pH 5.2, and 2.5 vol of ethanol. 10 ng .of each 
cDNA were used as templates in a 25 pi PCR reaction containing 1x PCR 

30 buffer (Amersham), 0.2mM dNTPs, 0.5 units of AmpliTaq polymerase 
(Amersham), and 0.5 pM each of the primer sequences derived from the 
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cDNA. Using a DNA Thermal Cycler 9700 (Applied Biosystems), we carried 
out 40 amplification cycles for 30 sec at 94°C, 30 sec at TM, and 45 sec at 
72°C. 

5' and 3' RACE strategy was carried out by using 5 pi of Marathon 
5 human kidney cDNA (Clontech), 1 x PCR buffer (Amersham), 0.2mM dNTPs, 
1.0 units of AmpliTaq polymerase (Amersham), and 0.5 uM each of the gene 
specific primer and AP1 primer (Clontech) in a final volume of 50 ul. 25 
amplification cycles were performed for 30 sec at 94°C, 20 sec at TM, and 4 
min at 68°C. Nested PCRs were carried out with the same protocol described 

1 0 for the first PCRs using 1/250 of the first PCRs, gene specific primers 2 and 
AP2 primer (Clontech) in a final volume of 50 pi. 30 amplification cycles were 
carried out for 30 sec at 94°C. 20 sec at TM, and 4 min at 68°C. Aliquots of 
the first and second PCR reactions performed with different gene specific 
primers were run on agarose gel and blotted on nylon membrane according 

15 to the manufacturer's directions (Amersham). The membranes were 

hybridized with a PCR product including exons 2, 3, 4 and 5 of the ZNF365D 
transcript. Positive PCR products were cloned in TOPO-vector (Invitrogen) 
and sequenced with M13 forward and reverse primers. 
RESULTS 

20 Transcriptional map of UAN locus 

We obtained a consensus genomic sequence of the 1.1 Mb region, 
corresponding to the identified 2.5 cM critical interval, by alignment of partial 
sequences deposited in different databases. The 1 .1 Mb interval was found 
to contain at least 6 novel genes including 2 uncharacterised ones (Figure 1). 

25 MRF-2 (modulator recognition factor 2) gene is encoded by 1 0 exons and 
spans a region of about 200 kb. This gene may be an ortholog of mouse 
Desrt. Homozygous mutants of this gene are developmentally and sexually 
retarded and have transient immune abnormalities (Lahoud et al. 2001). 
RTKN-L (rhotekin like) gene is a homologue of RTKN gene, an inhibitor of 

30 rho GTPase activity, located on chromosome 2 (Fu et al. 2000). RTKN-L 
consists of at least 10 exons and spans a region of 75 kb. Given their 
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biological function, these two genes were not obvious candidates for uric acid 
nephrolithiasis. The EST 603251916F1 sequence (GenBank Acc. BI603606) 
belongs to the UniGene cluster Hs.252954. By using the longest mRNA 
sequence of this UniGene cluster, we found that this gene is split in at least 
5 five exons and spans a region of 50 kb. The EST hd42c05.x1 (GenBank Acc. 
AW51 1012) exhibits no significant similarity to any ESTs deposited in public 
databases, suggesting that the gene is present at very low levels in human 
cells. KIAA0844 cDNA is derived from the HUGE protein database (Nagase 
et al. 2000). Finally, the EST 603040095F1 (GenBank Acc. BI822044) 
1 0 encodes for five exons and spans a small region of 1 5 kb. In the absence of 
a functional assignment, it is not possible to know which one of these 4 
orphan genes may play a role in the pathogenesis of UAN, even though the 
last EST is the closest to marker D10S1652 already found to be associated 
with UAN. 

15 Molecular characterization of the UAN susceptibility gene 

All previously described genes except the EST 603040095F1 were 
located outside the LD block and so none of them were further analysed. In 
order to characterize the 5 J and 3' regions of this gene, we performed RT- 
PCR and RACE strategies starting from the coding region of the EST 

20 603040095F1 . An extensive gene spanning a 300 kb region and overlapping 
the LD block was disclosed. Genomic and cDNAs sequence comparisons 
showed that the gene consists of 1 5 exons and includes the EST 
603040095F1 inside the critical region (delimited by AFM214zb6) and 
KIAA0844 outside it (Figure 1). In the human kidney, where we worked on a 

25 complex pattern of alternative splicing and transcriptional start sites, different 
proteins of 407, 333, 462 and 216 amino acids are generated (Figure 3, 
Table 1). 
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Table 1 
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ZNF365A transcript - KIAA0844 cDNA (4158 bp) is encoded by 5 
exons and its putative promoter region contains a CpG island. The open 
5 reading frame (ORF) of 1224 bp encodes a putative protein of 407 amino 
acids with a molecular weight of 46558 Da. The N-terminal region of this 
protein (amino acids 26-51) harbours the classical zinc finger domain of 
C2H2 family (Figure 4). This domain is found in numerous nucleic acid- 
binding proteins and has also been used in protein-protein interactions and 

10 membrane associations (Laity et al. 2001). This gene was renamed 
ZNF365A (zinc finger protein 365) by the HUGO Gene Nomenclature 
Committee. A secondary structure prediction for this protein suggests the 
presence of four predicted alpha-helical coiled-coil domains frequently used 
as an oligomerization motif. 

1 5 ZNF365B transcript - This transcript of 1 671 bp, was identified by EST 

analysis (associated ESTs are 602688686F1 and UI-H-BW1-amf-e-12-0- 
Ul.sl) and, like ZNF365A, was located outside the critical region. Unlike 
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ZNF365A, this transcript utilizes different exons 4 and 5 (Figure 3). The 
encoded protein of 333 amino acids is identical to the ZNF365A protein in the 
N-termina! region (exons 1-3) containing the zinc finger domain and coiled- 
coil fragments but it is different at the C-terminal (Figure 4). 
5 ZNF365C transcript - ZNF365C was isolated by RT-PCR analysis 

using different combinations of primers anchored in different exons of the 
gene. This transcript of 3376 bp overlaps the critical region. It uses the first 4 
exons and the same CpG island promoter of ZNF365B transcript outside the 
critical region and 4 different exons (9,13,14 and 15) located in the LD block 

10 (Figure 3). The encoded protein of 462 amino acids is similar to the ZNF365B 
protein but has a different C-terminal region. Computer-assisted analysis of 
this isoform based on transmembrane domain prediction revealed at least 
two membrane-spanning domains in this novel Oterminal region from amino 
acid 344 to 395, suggesting that it may be an integral membrane protein 

15 (Figure 4). 

ZNF365D transcript - ZNF365D spans a region of about 150 kb and 
uses its own specific TATA-box promoter. All coding exons of this transcript 
are completely included in the LD block associated with UAN (Figure 3). Full- 
length cDNA was obtained by 5' and 3* RACE strategies. The composite 

20 cDNA sequence of 2695 bp encodes a protein of 216 amino acids that is 
completely different from the proteins encoded by ZNF365A and S, but 
identical to the ZNF365C protein in the C-terminal region (Figure 4). 
Prediction analysis revealed a strong transmembrane domain in this isoform 
at positions 126-149, suggesting that, like ZNF365C, it could be an integral 

25 membrane protein with N-termini outside the cell. In this non cytosolic portion 
several potential modification sites were detected, including possible N and 
Q-iinked glycosylation sites (at Asn-20, Asn-82, Thr-88, Thr-89, Ser-90, Ser- 
91, Ser-93, Thr-97), suggesting that this outer-membrane protein portion may 
be glycosylated. In addition, several cysteine residues that may participate in 

30 disulfide bond formation were disclosed at positions 14, 27, 112, 113, 142, 
183, and 207. Finally, three possible protein kinase C sites at Ser-163, Ser- 
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165, Ser-167 at C-termini were shown (Figure 4). These latter consensus 
sites may be located intracellular^. 

The protein isoforms of 407, 333, 462 and 216 amino acids revealed 
no significant homology to other proteins deposited in public databases, 
5 suggesting that they could be a new class of undisclosed proteins. ZNF365B, 
C, D, revealed no signals by northern analysis, but RT-PCR assays showed 
low levels of expression in the blood, the kidney and the adrenal gland 
tissues. In particular, ZNF365D transcript was expressed in blood tissues 10 
times less than GAPDH gene expression in a real-time PCR experiment. 

10 A specific ZNF365D allele with an altered structure 

Sequencing of all exons of ZNF365 and part of the intronic regions 
was performed on eight selected patients carrying the putative high-risk 
nephrolithiasis susceptibility haplotype. Two variants causing missense in the 
exon 5 (ZNF365A transcript) were identified that lead to serine/alanine amino 

1 5 acid substitution (variant Ser337Ala) and in the exon 12 (ZNF365D transcript) 
that lead to alanine/threonine amino acid substitution (variant Aia62Thr) 
(Figure 3). Moreover, the authors identified a number of polymorphisms in 
the UTR regions and in intronic regions of the gene. 

An analysis was performed to determine whether the presence of 

20 these alleles confers increased risk to UAN. A comparison of allelic 
frequencies in UAN patients and controls showed a strong association 
between UAN and variant Ala62Thr (p=0.0096; empirical p=0.0052). When 
haplotype frequencies in cases and controls including the identified variants 
were compared, we observed increased evidence of association with UAN 

25 for the various haplotype configurations (p=0.0205; p=0.0239; p=0.0280; 
p=0.0306) (Figure 5). 

Computer-assisted analysis strongly predicted that the ala62thr 
variation induces secondary structural alteration of the ZNF365D protein in 
the third alpha helical loop (Figure 6). These results suggest that the effect of 

30 threonine causes a significant conformational change that may have 
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important implications for the biological function of this protein or its 

interaction with other proteins. 
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